An evaluation of the accuracy and speed of metagenome analysis tools

نویسندگان

  • Stinus Lindgreen
  • Karen L. Adair
  • Paul P. Gardner
چکیده

Metagenome studies are becoming increasingly widespread, yielding important insights into microbial communities covering diverse environments from terrestrial and aquatic ecosystems to human skin and gut. With the advent of high-throughput sequencing platforms, the use of large scale shotgun sequencing approaches is now commonplace. However, a thorough independent benchmark comparing state-of-the-art metagenome analysis tools is lacking. Here, we present a benchmark where the most widely used tools are tested on complex, realistic data sets. Our results clearly show that the most widely used tools are not necessarily the most accurate, that the most accurate tool is not necessarily the most time consuming, and that there is a high degree of variability between available tools. These findings are important as the conclusions of any metagenomics study are affected by errors in the predicted community composition and functional capacity. Data sets and results are freely available from http://www.ucbioinformatics.org/metabenchmark.html.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Diagnosis of the disease using an ant colony gene selection method based on information gain ratio using fuzzy rough sets

With the advancement of metagenome data mining science has become focused on microarrays. Microarrays are datasets with a large number of genes that are usually irrelevant to the output class; hence, the process of gene selection or feature selection is essential. So, it follows that you can remove redundant genes and increase the speed and accuracy of classification. After applying the gene se...

متن کامل

Simple and Sensitive Method of Fluorometry for Determination of Total Antioxidant Capacity

Background and Objective: Oxidative stress plays an important role in the pathogensis of various diseases, including lung cancer, chronic obstructive pulmonary disease, and atherosclerosis. Total antioxidant capacity plays a significant role in the body’s antioxidant defense, so its assessment is a matter of the utmost importance. Currently, assessment of this capacity is performed in var...

متن کامل

Predictions of Tool Wear in Hard Turning of AISI4140 Steel through Artificial Neural Network, Fuzzy Logic and Regression Models

The tool wear is an unavoidable phenomenon when using coated carbide tools during hard turning of hardened steels. This   work focuses on the prediction of tool wear using regression analysis and artificial neural network (ANN).The work piece taken into consideration is AISI4140 steel hardened to 47 HRC. The models are developed from the results of experiments, which are carried out based on De...

متن کامل

Treephyler: fast taxonomic profiling of metagenomes

SUMMARY Assessment of phylogenetic diversity is a key element to the analysis of microbial communities. Tools are needed to handle next-generation sequencing data and to cope with the computational complexity of large-scale studies. Here, we present Treephyler, a tool for fast taxonomic profiling of metagenomes. Treephyler was evaluated on real metagenome to assess its performance in comparison...

متن کامل

Evaluation and Ranking of Discrete Simulation Tools

In studying through simulation, choosing an appropriate tool/language would be a difficult task because many of them are available. On the other hand, few research works focus on evaluation of simulation tools/languages and their comparison. This paper makes a couple of evaluations and ranks more than fifty simulation tools that are currently available. The first evaluation and ranking is in th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 6  شماره 

صفحات  -

تاریخ انتشار 2016